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Abstract — A method to construct nonasymmetric distributed 
source coding (DSC) scheme using polar codes which can achieve 
any point on the dominant face of the Slepian-Wolf (SW) 
rate region for sources with uniform marginals is considered. 
In addition to nonasymmetric case, we also discuss and show 
explicitly how asymmetric and single source compression is done 
using successive cancellation (SC) polar decoder. We then present 
simulation results that exhibit the performance of the considered 
methods. 

Index Terms — Polar codes, Slepian-Wolf, distributed source 
coding, syndrome, nonasymmetric. 



I. Introduction 

In this paper we present a method to achieve any point on 
the "dominant" face of the Slepian-Wolf (SW) achievable rate 
region using polar codes for the case of sources with uniform 
marginals. SW coding (TJ refers to the distributed compression 
of a memoryless source pair (X,Y) ~ Px,y{ x iV)- The 
problem setting assumes separate encoding but joint decoding 
of sources X and Y, The surprising result of the SW theorem 
states that a total rate H(X, Y) is sufficient even with separate 
encoding. In this work we assume px,Y is a distribution on 
X x X where X = {0,1}. Let (x*,y N ) £ X N x X N 
be a pair of iV-vectors obtained by repeated independent 
drawings of source pair (X,Y). Then, X-encoder performs 
the mapping f x : X N -» {1, 2, ... , 2^-*} and F-encoder 
performs the mapping f Y : X N -> {1, 2, . . . , 2 NRy }. The 
decoder observing both of the sequences performs the mapping 
g : {l,2,...,2 NR x} x {1, 2, . . . , 2^} -> X N x X N . As 
N —> oo, such mappings with vanishingly small decoding 
error probability exist if rate pair (Rx,Ry) is inside the 
achievable rate region (SW rate region) described by the 
inequalities R x > H(X\Y), R Y > H(Y\X) and R x +Ry > 
H(X,Y). The corner points (Rx, Ry) = (H(X\Y),H(Y)) 
and (Rx, Ry) = (H(X),H(Y\X)) on the achievable rate 
boundary are also referred to as "asymmetric" operating 
points. And any point on the line segment between these corner 
points ("dominant" face), where Rx+Ry = H(X 1 Y), is also 
referred to as a "nonasymmetric" operating point. 

In the past decade, starting with the pioneering work of 
Pradhan and Ramchandran |2|, an extensive literature on 
applying channel codes for practical implementation of SW 
coding has been developed. Schemes utilizing turbo and LDPC 
codes in both asymmetric (3j, (4j and nonasymmetric (5), (6| 
SW problems were constructed. Also, "flexible rate" schemes 
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using LDPC (7) and turbo (8), (9) codes were devised. A 
flexible rate code refers to a SW coding scheme which has 
means to vary its total rate without much performance loss. 
The nonasymmetric schemes in [5|, |6| are based on "channel 
code partitioning" idea introduced in fTO) . A single channel 
code generator matrix is partitioned into two to be used by X 
and Y encoders. The partition is done in such a way that the 
desired rate allocation is achieved. Another method to achieve 
nonasymmetric rate allocation was introduced in fTT) . It uses 
the same channel code for both encoders. It is the method used 



in this paper and its details are given in Section III 



Polar coding fT2) , recently discovered by Arikan, is the 
first provably capacity-achieving coding method with low 
encoding and decoding complexity for the class of binary- 
input discrete memoryless channels. Shortly after its discovery, 
a number of work has been published which showed that 
polar codes are also provably optimal for source coding, 
asymmetric Slepian-Wolf and Wyner-Ziv problems 1 13|-| 17 1. 
Our main contribution in this work is to devise a simple 
practical nonasymmetric SW scheme using polar codes for 
the case of sources with uniform marginals by utilizing the 
framework of fTT] . 

The paper is organized as follows. First, we briefly discuss 
how to use polar encoder / SC decoder pair in practical 
asymmetric SW and single source compression settings. Then, 
we proceed to describing how to apply the framework of 
(TT[ on polar codes to achieve nonasymmetric SW coding 
of uniform sources. Lastly, we present simulation results that 
exhibit the performance of the considered schemes. We use SC 
list (SCL) decoder 1 18 1 with CRC to achieve the best possible 
results. The method of adding a simple CRC to information 
bits and using the SCL decoder in conjunction with this 
CRC was proposed by the authors of [18]. The advantage of 
this method is that it improves the performance for short to 
moderate block lengths with almost no extra complexity. We 
assume that the reader is familiar with polar codes, especially 
the concepts in fl2| , [ fT7| . The notations used for vectors and 
matrices are similar to those in JT2l and p9) . 

II. Asymmetric Slepian-Wolf and Single Source 
Compression 

In this work, the marginals of both sources X and Y are 
assumed to be uniform. Let Y = X S> Z, where Z ~ Ber(p). 
In other words, Y is the corrupted version of X by a BSC(p). 
Thus, H(X\Y) = H(Y\X) = H(p), where H(p) is the binary 
entropy function — p ■ log 2 (p) — (1 — p) ■ log 2 (l — p). For 
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Fig. 1. Polar codes for asymmetric SW compression. 

asymmetric SW setting, side information (SI) y is assumed 
to be present at the decoder error-free. Then, this setting is 
actually not much different than a channel decoding problem. 
Given the observation y N , which is a corrupted version of x N 
by a BSC(p), it is nothing but a channel decoding problem to 
recover x N . The only difference compared to a usual channel 
decoding is that the coset in which the search to be performed 
is not the zero syndrome one but given by the syndrome of x N . 
Thus, the compression operation is nothing but calculating the 
syndrome s JV_A of x . One major advantage of polar codes 
is that this nonstandard decoding is readily implemented with 
a SC decoder (12). 

A polar code is identified by a parameter set 
(N, K, A, ua"), where N is the block length, K is the 
code dimension, A is the information set of size K and ua" 
is the frozen bits vector of size N — K. The frozen bits 
identify a coset of the linear block code with generator 
matrix (Gn)a P2[ - (Gn)a denotes the submatrix of Gn 
formed by the rows with indices in A. Therefore, a parity 
check matrix can be found so that ua c is the syndrome of 
this code. Since G N l = Gn, the syndrome calculation is just 
a polar encoding operation followed by extracting the bits 
corresponding to the frozen indices: 

s N ~ K = u A a = (x N G N 1 ) A c = {x N G N ) A o. (1) 

The encoding and decoding operations are summarized in 
Fig{T] Encoding realizes the operation defined in ([TJ. Decoding 
is done using the SCL decoder which outputs estimates of both 
x and u. x is the estimate of the uncompressed bit sequence. 
"LLR calculation" block calculates the LLRs according to the 
assumed correlation between sources X and Y, which is a 
BSC(p): 
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where l crc denotes the length of CRC bits, if any. CRC is 
used in conjuction with SCL decoder to increase performance. 
If it is used, the length of information block is reduced to 
N' = N — l crc . Note that in ((2J, while the statistics of the first 
(N — l C rc) bits are known (Ber(p)) and used for decoding, the 
statistics of the CRC bits are assumed to be uniform. 

For single source compression, the SI vector y N is replaced 
with all-zeros vector N at the input of the decoder. 



III. NONASYMMETRIC SLEPIAN-WOLF COMPRESSION 

A method to construct nonasymmetric SW scheme from 
a single channel code for the case of uniformly distributed 
sources using syndrome approach was proposed in fTT) . This 
method was recently used in (5) to construct a both nonasym- 
metric and rate-compatible SW scheme using turbo codes. 
In this section, we apply the method of fTT| to construct a 
nonasymmetric SW setting using polar codes. 
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Fig. 2. Encoding for nonasymmetric SW with method of |1 1 1 

The method of 1 1 1 1 can be summarized as follows. Consider 
two uniform i.i.d. distributed and correlated iV-vectors x = 
[x a x b ] and y = [y a y b ], where x a represents the first K 
bits and x fc represents the last N—K bits of vector x (the same 
applies to y). Also, let x a = [xj xf] and y a = [yf yg]. xj 
represents the first K\ bits and ~xS^ represents the last K2 bits 
of x a (the same applies to y a ), where Ki+K 2 = K. Let G be 
K x N generator matrix and H = [H a H b ] be (N-K)xN 
parity check matrix of the channel code. Also assume that H a 
is (N - K) x K submatrix and H b is (N - K) x (N - K) 
submatrix of H such that Hi, is selected as nonsingular. Notice 
that the systematic version of a code is a special case with 
Hb = I N-K- The syndromes of x and y are calculated as 
s x = xiJ T = x a #J 8 x b H fc T and s y = yH T = y a Hj © 
y b H^, respectively. Then, X-encoder sends (xj,s x ) and Y— 
encoder sends (y|, s y ). The information sent by both encoders 
are marked with the shaded regions in Figj2] By varying K 1 
(keeping Ki + K 2 = K at constant) different points on the 
SW rate region can be reached. Since X and Y are assumed 
to be uniform i.i.d. sources, H{X) = H{Y) = 1. Then, the 
code is adjusted so that (N-K) > NH(X\Y). Thus, the 
total rate stays above entropy bound: 

NR = N[R X + R Y ] = (Ki + (N — K)) + (K 2 + (N - K)) 
NR = N+(N-K)> N[H(Y) + H(X\Y)\ = NH(X, Y). 

The decoding of the above scheme, which is depicted in 
Fig|3] is done as follows. Let e = x y be the error vector. 
Then, s e = eH T = (x ffi y)H T = s x © s y . The channel 
decoder is supplied with all-zeros vector as input and s e as 
the coset index. The estimate e is obtained as the output. With 
this estimated error pattern, x. 2 and y" can be recovered using 
y 2 and xj, respectively. Finally, x b and y b are obtained as 



x" = (s x ©x a ijj)( J ff & T r 1 , 

y b = (s y( Sy a H^)(H k T )-\ 



(3) 
(4) 



Note that, although it is not shown explicitly in Fig|3] like- 
lihood calculation of the the all-zeros vector input to the 
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Fig. 3. Decoding for nonasymmetric SW with method of [11 . 

decoder is done using the crossover parameter p of the virtual 
BSC between sources X and Y as given in Q. 

Now, this method cannot be applied to the standard form of 
polar codes in [12|. However, the systematic version of polar 
codes fT9) which was introduced recently can be used. The 
systematic polar coding is defined as follows. The codeword 
x is split into two parts x = (xg,xgc), where B is a subset 
of {1,...,N} such that \B\ = \A\ and submatrix Gab is 
invertible. Gab consists of the array of elements (G; j) with 
i € A and j € B. Then, the "systematic" and "parity" part of 
the codeword x can be written as 



x e — u aGab 



u a^Ga"b 
f- ua"Ga"B" 



(5) 
(6) 



respectively. The difference of this definition compared to a 
usual systematic code definition is that the systematic bits do 
not constitute the first K bits of the codeword x but rather 
a different subset of locations identified by the index set B. 
Now with the above definitions, a systematic encoder defined 
with parameter (B, u^c) can implement the mapping xg H> 
x = (xg , xgc ) by computing 



ua = (xg © ^a-Ga'b){Gab\ 



(7) 



and then inserting u_4 into |6) to obtain xgc. 

Now returning back to the nonasymmetric SW method of 
JTT] described above, we set x a = xg, x fc = xgc and s x = 
\iA a - This way we fulfill the requirements of the method such 
that when x a is decoded using the estimated error vector e, 
the rest, x b , can be recovered from x a and s x . Given x a = xg 
and s x = u^c, computing x fc = xgc and u.4 is nothing but a 
systematic polar encoding operation summarized above. And 
it can be done efficiently using a SC polar decoder |19|. For 
the standard form of polar codes defined in fl2) , the index 
set B can be selected as the permuted version of A. This 
permutation corresponds to the bit-reversal operation. 

The use of CRC with this nonasymmetric scheme is also 
possible. For both sources, N' = N — l crc length information 
blocks are completed to N with l crc bits of CRC. Since CRC 
operation is linear, the CRC of error vector e = xffiy must also 
check. Thus, the SCL channel decoder can use this information 
when estimating the error vector. 
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Fig. 4. Asymmetric SW bit error rate (Rx = 0.5). 

IV. Simulation Results 

In this section, we present simulation results on perfor- 
mances of the source coding methods discussed. The correla- 
tion model between sources X and Y is given as Y = X © Z, 
where Z ~ Ber(p). In all of the plots, the rates of codes are 
kept at a defined constant value while p is varied to achieve 
different H(X\Y) points. For all of the SW schemes the 
plotted BER corresponds to the averaged value over X and 
Y sources. The polar decoder used is the SCL decoder of 
1 18]. To improve the performance, a 16-bit CRC (CCITT) is 
added. The list decoder selects the output from the final list 
with the aid of CRC. Note that, for source coding, CRC is 
appended to the "codeword" vector x as opposed to channel 
coding case where it is appended to "information" vector u^. 
The list size is set to 32 for all cases. The code construction 
is done via the method proposed in |20) and optimized to 
p = 0.09 for R = 0.5, p = 0.03 for R = 0.25 and p = 0.013 
for R = 0.125. 

A. Asymmetric Slepian-Wolf and Single Source Compression 

As it is pointed out in Section|IlJ single source coding can be 
considered as a special case of asymmetric SW coding. Single 
source compression of an i.i.d. source X with H(X) = H(p) 
and asymmetric SW coding of i.i.d. pair (X, Y) with uniform 
marginals and H(X\Y) = H(Y\X) = W(p) are essentially 
the same. Therefore, their simulation performances come out 
to be identical, as expected. In Fig|4j BER curves of asymmet- 
ric SW setting for Rx = 0.5 are shown. Results for different 
code block lengths and rate values are given in Table [I] The 
values in the table show the entropy of source correlation 
{H(p) = H(X\Y)) when a BER of 10" 5 is achieved. For 
example, the gap to SW bound for Rx = 0.5 and N = 65536 
is 0.056. 

TABLE I 

Asymmetrical SW performance 

(H(X\Y) VALUES FOR A BER OF 10" 5 ) 



Rx \ N 


2048 


4096 


16384 


65536 


131072 


0.5 


0.360 


0.386 


0.423 


0.444 


0.453 


0.25 


0.118 


0.153 


0.190 


0.205 


0.210 


0.125 


0.032 


0.044 


0.075 


0.096 


0.101 
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Fig. 5. Nonasymmetric method for N = 65536 together with the SW bound. 

B. Nonasymmetric Slepian-Wolf Compression 

The performance of nonasymmetric scheme for H(p) = 0.5 
and N — 65536 is presented in Fig|5| Results for different 
block lengths are given in Table niT A BER of 10~ 5 is 
considered to be lossless when determining the rate points. 
The performance with rates allocated such that it results in 
asymmetric setting (Rx,Ry) = (0.5,1), is identical to the 
results of asymmetric method of Section [TT] given in Table [I] 
(a gap of 0.056). This is expected, because when asymmetric 
rate allocation is made in the nonasymmetric method, y N is 
recovered perfectly. Also, only the syndrome vector is sent 
from the X-encoder. Thus, for asymmetric rate allocation, the 
method of Section [HI] reduces to the method of Section [H] 
Two other operating points, corresponding to the symmetric 
rate (0.75,0.75) and a nonasymmetric rate (0.625,0.875), are 
also marked on Fig|5] The performance of these rates are 
slightly inferior to the asymmetric case. This is expected, 
since as opposed to asymmetric case where no error is made 
for the source Y, in nonasymmetric cases estimation of Y is 
also prone to errors, furthermore these errors propagate to the 
recovery of X. Also note that the performance is the same for 
all nonasymmetric points. 

V. Conclusion 

We started by reviewing asymmetric SW and single source 
compression using polar codes. Then, using the general frame- 
work of fTT| , we showed how polar codes can be used for 
constructing nonasymmetric SW scheme for uniform sources. 
A successive cancellation list decoder with addition of 16- 
bit CRC is used to achieve best performances. Although the 
performances are very good, they are slightly inferior to the 
best performances reported in the literature using turbo and 
LDPC codes [3], [6|, [9|. However, there are some advantages 
to using polar codes. No modification is needed to the SC 
decoder since syndrome decoding is readily implemented. 
The length of the syndrome can be modified easily and 
incrementally, which gives rise to flexible rate adaptation. 
This might be an incentive to use polar codes in varying 
correlation conditions. Also, exploring advantages of polar 
codes against turbo and LDPC codes in terms complexity and 
latency remains as a future research topic. 



TABLE II 

Nonsymmetrical SW performance for R = 1.5 

(H(X, Y) VALUES FOR A BER OF 10~ 5 ) 



(R X ,R Y ) \ N 


2048 


4096 


16384 


65536 


(0.500, 1.000) 


1.361 


1.388 


1.424 


1.444 


(0.625,0.875) 


1.321 


1.349 


1.402 


1.435 


(0.750,0.750) 


1.321 


1.349 


1.402 


1.435 
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